WHAT IS CLAIMED IS : 

1 . A method of extracting audio for indexing of video, comprising: 
receiving video information having embedded audio information and associated 

time codes; 

capturing the embedded audio information in the video information; 

extracting a plurality of audio metadata tracks from the audio information, each 
audio metadata track having selected ones of the time codes indicative at least of start 
and stop times for the audio metadata track; 

encoding the video information; and 

accessing the encoded video information with the selected time codes of one of 
the audio metadata tracks. 

2. The method of Claim 1 , wherein the video information is received from 

an analog source. 

3. The method of Claim 2, wherein the analog source is a videotape deck. 

4. The method of Claim 2, wherein the analog source is a live satellite feed. 

5. The method of Claim 1 , wherein the video information is received from a 
digital source. 

6. The method of Claim 1 , wherein the capturing includes digitizing with 
an audio digitization devices 

7. The method of Claim 1, wherein the plurality of audio metadata tracks 
includes at least one of: keywords, speech-to-text transcription, speaker identification 
and audio class. 

& The method of Claim 1 , wherein the time codes comprise SMPTE codes. 

9. The method of Claim 1 , wherein the encoding comprises encoding with 

an MPEG format 

1 0. An audio engine for extracting metadata tracks, comprising: 
an audio signal switch receiving an audio signal; 

an audio classification component controlling the audio signal switch according 
to whether the audio signal is classified as speech; and 



-25- 



a plurality of audio metadata track extraction components in data 
communication with the output of the switch, wherein each audio metadata track 
extraction component provides an audio metadata track associated with speech. 

1 1 . The audio engine of Claim 1 0, additionally comprising: 

an audio capture component for capturing and digitizing an analog audio source; 

and 

an audio signal normalization component for normalizing the digitized audio 
prior to processing. 

12. The audio engine of Claim 1 0, wherein the audio metadata tracks include 
at least one of: keywords, speech-to-text transcription and speaker identification. 

1 3. The audio engine of Claim 1 0, wherein the audio classification 
component additionally classifies at least silence and music. 

1 4. The audio engine of Claim 1 0, wherein the audio metadata track 
extraction components receive data from a customizable dictionary. 

1 5. The audio engine of Claim 1 0, wherein the audio signal is received from 
a real-time source. 

1 6. The audio engine of Claim 1 0, wherein the audio signal is received from 
a digital source. 

17. The audio engine of Claim 10, wherein the audio signal is received from 
a digital camcorder. 
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